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the persistent observables in the MG and the distribution of history frequencies. 
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1. Introduction 

Minority Games (MG) pQ are simple and transparent models which were designed 
to increase our understanding of the complex collective processes which result from 
inductive decision making by interacting agents in simplified 'markets'. They are 
mathematical implementations of the so-called El Farol bar problem [2|. Many versions 
of the MG have by now been studied in the literature, see e.g. the recent textbook [3] 
for an overview. They differ in the type of microscopic dynamics used (e.g. batch versus 
on-line, stochastic versus deterministic), in the definition of the information provided 
to the agents (real-valued versus discrete, true versus fake market histories) and the 
agents' decision making strategies, and also in the specific recipe used for converting 
the observed external information into a trading action (inner products versus look-up 
tables). Models with 'fake' market histories (proposed first in [I]), where at each point 
in time all agents are given random rather than real market data upon which to base 
their decisions, have the advantage of being Markovian and were therefore the first to be 
studied and solved in the theoretical physics literature using techniques from equilibrium 
[3 EE 13 Ej and non-equilibrium E3 HH H2] statistical mechanics. 

After [4. had revealed the similarity between the behaviour of the volatility in the 
standard MG models with real versus fake market histories, it was shown via numerical 
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simulations that this statement did not extend to many variations of the MG, such 
as games with different strategy valuation update rules ^3] o r with populations where 
agents do not all observe history strings of the same length ^3]. Furthermore, even in the 
standard MG one does find profound differences in the history frequency distributions 
(although there these differences do not impact on observables such as the volatility 
or the fraction of 'frozen' agents). A partly phenomenological attempt at analyzing 
quantitatively the effects of true history in the MG was presented in and followed 
by a simulation study ^Bj of bid periodicities induced by having real histories. After 
these two papers virtually all theorists restricted themselves to the exclusive analysis of 
MG versions with fake histories, simply because there is no proper theory yet for MG 
versions with real histories, in spite of the fact that these are the more realistic types. 

There would thus seem to be merit in a mathematical procedure which would allow 
for the derivation of exact dynamical solutions for MGs with real market histories. 
The objective of this paper is to develop and apply such a procedure. Models with real 
market histories are strongly non-Markovian, so analytical approaches based on pseudo- 
equilibrium approximations (which require the existence of a microscopic Lyapunov 
function) are ruled out. In contrast, the generating functional analysis (GFA) method of 
[T7] , which has an excellent track record in solving the dynamics of Markovian MGs, will 
turn out to work also in the case of non-Markovian models. There are two complications 
in developing a GFA for MGs with real histories. Firstly, having real histories implies 
that no 'batch' version of the dynamics can be defined (since batch models by definition 
involve averaging by hand over all possible histories). Thus one has to return to the 
original on-line definitions. Secondly, the temporal regularization method [TH] upon 
which one normally relies in carrying out a GFA of on-line MG versions is no longer 
helpful. This regularization is based on the introduction of random durations of the 
individual on-line iteration steps of the process, which disrupts the timing of all retarded 
microscopic forces and thereby leads to extremely messy equations! . Thus, one has to 
develop the GFA directly in terms of the un-regularized microscopic laws. 

This paper is divided into two distinct parts, similar to the more traditional GFA 
studies of MGs with fake market histories. The first part deals with the derivation of 
closed macroscopic laws from which to solve the canonical dynamic order parameters 
for the standard (on-line) MG with true market history. These will turn out to be 
formulated in terms of two effective equations (rather than a single equation, as for 
models with fake histories): one for an effective agent, and one for an effective overall 
market bid. These equations are fully exact. The second part of the paper is devoted to 
constructing solutions for these effective processes. In particular, this paper focuses on 
the usual persistent observables of the MG and on the distribution of history frequencies, 
which are calculated in the form of an expansion of which the first few terms are derived 
in explicit form. The final results find excellent confirmation in numerical simulations. 

| Note that in models with fake histories there are no retarded microscopic forces, so that there this 
particular problem could not occur. 
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2. Definitions 

2.1. Generalized Minority Game with both valuation and overall bid perturbations 

In the standard MG one imagines having N agents, labeled by i — 1, . . . , N. At each 
iteration step £ £ {0, 1, 2, . . .} of the game, each agent i submits a 'bid' bi{£) £ {— 1, 1} 
to the market. The (re-scaled) cumulative market bid at stage £ is defined as 

1 N 

A(t) = ^=J2Ut) + M?) (i) 

v iV i=1 

An external contribution A e (£) has been added, representing e.g. the actions of market 
regulators, which will enable us to identify specific response functions later. Profit is 
assumed to be made by those agents who find themselves subsequently in the minority 
group, i.e. when A(£) > by those agents i with h{£) < 0, and when A(£) < by 
those with bi{£) > 0. Each agent i determines his bid bi(£) at each step £ on the basis 
of publicly available information, which the agents believe to represent historic market 
data, here given by the vector A(£, A, Z) £ { — 1, 1} M : 

/ sgn[(l - C)A(t - 1) + CZ(t, 1)] \ 



X(£,A,Z) 



(2) 



V Bgn[(l-C)A(£-M) + (Z(£,M)] J 

The numbers {Z(£,X)}, with A = 1, . . . , M, are zero-average Gaussian random variables, 
which represent a 'fake' alternative to the true market data. M is the number of iteration 
steps in the past for which market information is made available. We define a = 2 M /N, 
and take a to remain finite as N — > oo. The parameter ( £ [0,1] allows us to interpolate 
between the cases of strictly true (( = 0) and strictly fake (( = 1) market histories. We 
distinguish between two classes of 'fake history' variables: 

consistent : Z(£, A) = Z{£ - A), (Z(£)Z(£')} = K 2 5 ee (3) 
inconsistent : Z(£,X) all independent, (Z(£, X)Z{£\ A')) = k 2 <5«'<5aa' (4) 

We note that (jlj) does not correspond to a pattern being shifted in time, contrary to 
what one expects of a string representing the time series of the overall bid, so that the 
agents in a real market could easily detect that they are being fooled. Hence seems 
a more natural description of fake history. Although fake, it is at least consistently so. 

Each agent has S trading strategies, which we label by a = 1, . . . , S. Each strategy 
a of each trader i consists of a complete list R m of 2 M recommended trading decisions 
{R 1 ^} £ {—1, 1}, covering all 2 M possible values of the external information vector A. 
We draw all entries {R 1 ^} randomly and independently before the start of the game, with 
equal probabilities for ±1. Upon observing history string X(£,A,Z) at stage £, given 
a trader's active strategy at that stage is aj(£), the agent will follow the instruction of 
his active strategy and take the decision bi(£) = . To determine their active 

strategies a,(£), all agents keep track of valuations Pi a {£), which measure how often and 
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to what extent each strategy a would have led to a minority decision if it had been used 
from the start of the game onwards. These valuations are updated continually, via 

Pia (£ + 1) = p ia (£) ~ -^A(£)Rl (w) (5) 



The factor fj represents a learning rate. If the active strategy Oi(£) of trader i at stage I 
is defined as the one with the highest valuation Pi a {€) at that point, and upon writing 
^F\[£, A, Z] = yaN 5^ A z y our process becomes 



Pia{£ + 1) = Pia(i) ~ 



^)E^ A [M^] (6) 



a(£) ^eM + T^EE^SM^] (7) 

a,i(£) = arg max APia(£)} (8) 

ae{l,...,S} 

We note that (aATp 1 £ A 1 = {aN)' 1 £ A f\[£, A, Z] = 1. The standard MG is 
recovered for £ — > (i.e. true market data only), whereas the 'fake history' MG as 
in e.g. |Hin| is found for ( — > 1 (i.e. fake market data only, of the inconsistent type Q). 

Henceforth we will restrict ourselves to the simplest case S = 2, where each agent 
has only two strategies, so a £ {1,2}, since the choice made for S has been shown to 
have only a quantitative effect on the behaviour of the MG. Our equations can now be 
simplified in the standard way upon introducing the new variables 

qi(£) = \\pn(£)-Pi2(£)} (9) 

= ^[R il + R i2 }, C = i[R' ;i -R i2 ] (10) 

and Q = N" 1 ^ 2 Y,i u l . The bid of agent i at step £ is now seen to follow from 

= w i + sgn[ ft (^ (11) 

The above 5* = 2 formulation is easily generalized to include decision noise: one 
simply replaces sgn[gj(£)] — > a[qi(£), Zi(£)], in which the {zj(£)} are independent and 
zero average random numbers, described by a symmetric and unit-variance distribution 
P(z). The function o~[q, z] is taken to be non-decreasing in q for any z, and parametrized 
by a control parameter T > such that a[q, z) £ { — 1, 1}, with \im T ^ a[q, z] = sgn[g] 
and limT^oofdz P(z)a[q, z] = 0. Typical examples are additive and multiplicative 
noise definitions, described by a[q, z] = sgn[g + Tz] and a[q, z] = sgn[g] sgn[l + Tz], 
respectively. The parameter T measures the degree of randomness in the agents' decision 
making, with T = bringing us back to a,i(£) = arg m&x a {pi a (£)} , and with purely 
random strategy selection for T = oo. 

Upon translating our microscopic laws (|fi|7j) into the language of the valuation 
differences (JHJ) for S = 2, we find that now our MG equations close in terms of our 
new dynamical variables {qi(£)}, so that perturbations of valuations (again for the 
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purpose of defining response functions later) can be implemented simply by replacing 
9i(-0 — * <7i(-0 + with 9i(£) G H. Thus we arrive at the following closed equations, 
defining our generalized S = 2 MG process: 

j = > <i i > \ — , 



A{£) = A e (£) + 
F\[£,A, Z] = Vc^X 6 

\(e,A,z)- 



(12) 

]T {fi A + -1= 2 *(*)]f A }.F A [*, A, Z] (13) 

(14) 

(15) 



/ sgn[(l-C)A(£-l) + C^,l)] \ 



V sgn[(l-()A(£-M) + (Z(£,M)} J 

The values of {A(£), Z(£)} for £ < and of the g^(0) play the role of initial conditions. 

The key differences at the mathematical level between MG models with fake 
history and those with true history as defined above, are in the dependence of the 
microscopic laws on the past via the history string {A(£ — 1), ... , A{£ — M)} occurring 
in \(£, A, Z) G {—1, 1} M , in combination with the presence and role of the zero-average 
Gaussian random variables {Z(£, A)}. 



2.2. Mathematical consequences of having real history 

In all generating functional analyses of MGs which have been published so far, the choice 
C = 1 eliminated with one stroke of the pen the dependence of the process on the history 
{A{£ -1),...,A{£- M)}. The variables {Z{£, 1), . . . , Z(£, M)} could subsequently be 
replaced simply by integer numbers //, labeling each of the 2 M = p = aN possible 
'pseudo-histories' that could have been drawn at any given time step £. Here this is no 
longer possible. The variables {Z(£, A)} now play the role of random disturbances of the 
true market history as perceived by the agents, and there is no reason why all possible 
histories should occur (let alone with equal frequencies) or why some entries {Z(£, A)} 
(e.g. those with small values of A, which corrupt the most recent past in the history 
string) could not be more important than others. The problem has become qualitatively 
different. One can thus anticipate various mathematical consequences for the generating 
functional analysis of introducing history into the MG. An early appreciation of these 
will help us to proceed with the calculation more efficiently 

Firstly, we will have to analyze the original on-line version of the MG; the batch 
version can no longer exist by definition, since it involves averaging by hand over all 
possible 'histories' at each iteration step. However, the temporal regularization method 
of ^Hj which was employed successfully for the on-line MG with fake history ^Hj , based 
on introducing Poissonnian distributed real- valued random durations for the individual 
iterations in ()12I13|) . can in practice no longer be used in the non-Markovian case. 
The reason for this is the problem which prompted the authors of [10J to add the 
external perturbations 6i(£) to the regularized on-line process rather than to the original 
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equations: whereas in a Markov chain the introduction of random durations for the 
individual iteration steps only implies a harmless uncertainty in where we are on the time 
axis, in a system with retarded interactions one would generate very messy equations. 
We must therefore proceed with our process as it is, without temporal regularization 
(although we will be able to recover the previous theory in the limit ( —>■ 1, as it 
should). It will in fact turn out that the more direct application of the generating 
functional method presented in this paper brings the benefit of greater transparency. 
For instance, the continuity assumptions underlying our use of saddle-point arguments 
in path integrals become much more clear than they were in jTUj . As always we continue 
to concentrate on the evaluation and disorder averaging of the generating functional 

Z[t(>] = ( e , S ( >oE,^W ff feWAW]) (16) 

The brackets in (fTBj) denote averaging over the stochastic process (jl2H3|) . whose 
randomness is here caused by the decision noise {%{£)} and the fake history variables 
{Z(£, A)}. Although (j!6|) looks like the corresponding expressions for batch MGs, here 
we have to allow for i = O(N). Studying the un-regularized process also implies that one 
has to be more careful with finite size corrections. This has consequences in working out 
the disorder average of the generating functional: in previous MG versions one needed 
only the first two moments of the distribution of the strategy look-up table entries. 
Here, although one must still expect only the first two moments to play a role in the 
final theory, the need to keep track initially of the finite size correction terms implies 
that our equations simplify considerably if, instead of binary strategy entries, we choose 
the variables {R 1 ^} to be zero-average and unit-variance Gaussian variables. 

It will turn out that in our analysis of an important role will be played by the 
following quantity: 

W[£, A,Z] = -^J2 F% A, Z\T[t\ A Z] 

= ^X(i,A,Z),X(£',A,Z) (1^) 

This object is a function of the paths {A} and {Z}, and indicates whether or not the 
histories as perceived by the agents at times i and £' are identical (irrespective of the 
extent to which these 'histories' are true). Its statistics are trivial in the absence of 
history, but will here generally contain information regarding the recurrence of overall 
bid trajectories. For reasons of economy we will formulate our theory in terms of 
the quantity (JEj), rather than substitute a z) X(i' a z) directly. This will prevent 
unnecessary future repetition, since it will allow for most of the theory to be applied 
also to MG models with inner product rather than look-up table definitons for the 
agents' history-to-action conversion [THj . 
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3. The disorder averaged generating functional 

3.1. Evaluation of the disorder average 

Rather than first writing the microscopic process in probabilistic form, as in [10], we 
will express the generating functional (fTT)|) as an integral over all possible joint paths 
of the state vector q and of the overall bid A, and insert appropriate ^-distributions to 
enforce the microscopic dynamical equations l)12|13J) . i.e. 

^A(£)[A(l)-A e (£)-^= J2\ { n \+vw E, ^iiMMm i x }^x [e ' A ' z] 

(since our microscopic laws are of an iterative and causal form, they have unique 
solutions). To compactify our equations we will use the short-hand Sj(£) = c Zi{£)]. 
We can now write the disorder average Z[ij}] of (fTBj) as follows: 




-p-j- dA(£)dA(£) c iA(i)\A(£)-Ae(D] 
to ^ 



n 



dqi(£)dqi(£) ... {e) [q . w _ e . W] +#j Wsj ( £) 



2tt 



;is) 



x e 



jvys Ea Ej 



Wi(W\A(£)-A({)lu\+si{l)e 



r x [t,A,z\ 



{z,Z} 



The brackets (. . -){ z ,z} denote averaging over the Gaussian decision noise and the 
pseudo-memory variables, and we have used the abbreviations (I10J) . The short-hand 
Du = (27r) _ 5e - ^ 2 and the previously introduced quantity W[. . .] in (fTTj) allow us to 
write the disorder average (over the independently distributed zero-average and unit- 
variance R 1 ^) in the last line of (fT8j) as 



x nn j Dv e ' 

A » 



= e -4iv E«/> w[i,i';A,z] J2i[m(i)A(e)-Mi>i(e)] [m{e)M*)-Me)'i(n] 

We next isolate the usual observables L(£,£') = N' 1 E* ft(^) ^(40 = 
N~ l Y,iSi(£)qi(£'), and C(£,£') = N -1 J2i Si(£)si(£'), by inserting appropriate integrals 
over 5-distributions. We also use the abbreviations PC = n«'[v^V^C(£, £')/y/2n] 
(similarly for other two-time observables) and Z^4 = Y\ e>0 [dA(£) / \^2n] (similarly 
for A). Initial conditions for the ^(0) are assumed to be of the factorized form 
Po(q) — ]liPo(9i(0)). In anticipation of issues to arise in subsequent stages of our 
analysis, especially those related to the scaling with N of the number of individual 
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iterations of the process, we will also define the largest iteration step in the generating 
functional as £ ma , x . All this allows us to write Z[ij)\ in the form 

Z\$\ = JVCVCVKVKVLVL e iN?2 e *ieVf)CW)+&(if)K(twLWLW)} 
x e o(e ax i°gJV) J V AVA e l ^c A(e)[m ~ Me)] 

X e ^E«' W[t/-A,Z]{A{i)K{i/')A{i')+A{l')K{i',i)A{i)} 

x e -|Ea'^. / '! ji . z ]{« a - A W r (^)^)+^W[i+^/)]^(<')} 

dqi(l)dqi{l) {e) [q . ^ +1 y gi . ^) Sj (*) 
2tt 



with 



and with 



x 



n 



•IWft(o)) 



x TT e -<Ea^ i (^)ftW&(0+*(<.0«iW4i(^)+^/ / )«iW«<('')} 
i l 

J VCVCVKVKVLVL e N\*+n+*]+o(P m io S N) 



{z,Z} 



$ = i £ [c(£, £')C(£, £') + £')K(£, £') + L(£, f )L(£, £') 



x e 



/{Z} 



(20) 
(21) 

(22) 







1 

N 



n 

£=0 



dq{£)dq{£) 
2^ 



Po(g(o)) 



x e ^ e < e 



[«Wb(/+i)-*iW-tf i W]4^(<)«Tb(<) ) *w]]-iX)«'<< In « 4 W £ ^' / 'WW 



x e 



(23) 



M[£, f ; A, i] = ?? 2 A(£)L(£, f )A{£' ) - 77[i(l)K(^, f ) A(f ) + A[£')K[£\ £)A{£)\ 

+ A(£)[1 + C(£,£')}A(£') (24) 

The 0(£^ ax log A^) corrections in (|2U|) are constants, which reflect the scaling with TV 
used in defining the conjugate order parameters. 

Compared to the Markovian (fake history) MG versions, we note that \l/ and Q take 
their conventional forms, and that all the complications induced by having true market 
history are concentrated in the function $[C, K, L], which is now defined in terms of a 
stochastic process for the overall bid A{£) rather than being an explicit function of the 
order parameters (which had been the situation in all fake history versions of the game), 
and in the remaining task to implement an appropriate scaling with N of the time scale 
•^max- We can now also see the advantage in our earlier decision to define Gaussian rather 
than binary look-up table entries. With the iV-scaling of £ max still pending, instead of 
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(|19|). in the binary case we would have found 



x e L 



(25) 



In this expression we see that, for £ max = O(N), the different choices of strategy look-up 
table entry distribution will give the same results only for those paths {A, Z} where 
the frequency of occurrence each of the 2 M possible histories is of order 0(N~ 1 ). In 
the latter case the function A, Z] scales effectively inside summations over i as 

A, Z] = 0(Ns), and we return to (HHJ). Thus, for non-Gaussian distributions of 
the {R 1 ^} at this stage of the GFA one either has to carry on with the more complicated 
expression ([25)1. which cannot be expressed in terms of the order parameters {C, K, L}, 
or one has to make further assumptions on the overall bid statistics, which (although 
turning out to be correct) require validation a posteriori. 

3.2. Canonical time scaling 

For the on-line MG with random external information (i.e. with ( = 1) it is known 
that the relevant time scale is £ max = O(N). Rather than imposing the time scale 
f max = O(N) by hand, it is satisfactory to see that one can also extract this canonical 
time scaling from our present equations ()2()J21J22p23j) . 

For finite £ max we immediately find liniAr^oo $ = in (jUJ), and our generating 
functional will be dominated by the physical saddle-point of liniAr_^ 00 [ , I' + Q], giving 
C = K = L = 0. This leads to a trivial effective single spin problem, which just 
describes a frozen state. This makes perfect sense in view of our definitions (|12|13|) : 
individual updates of the variables are of order AT", so nothing can change on 
time-scales corresponding to only a finite number of iteration steps. Thus our present 
equations automatically lead us to the choice £ max = 0(1/ Sn), where liniAr__ +00 5n = 0; the 
function $ will indeed scale differently as soon as £ max is allowed to diverge with N. We 
thus define £ max = t m a X /<5iv, where < t max < oo (of order A^ ) and with linijv^oo 5n = 0. 
In order to obtain well-defined limits at the end in (|2ip. we see that we have to re-scale 
our conjugate order parameters according to (C,K,L) — > 5n 2 (C,K,L). Furthermore, 
for the perturbation fields {9i,ipi\ to retain statistical significance they also will have 
to be re-scaled in the familiar manner, according to (9i,ipi) — ► Sjf (0i,ipi) (similar to 
[Til]). The integrations over order parameters and conjugate order parameters in (|2~U|) 
will now become path integrals for A^ — > oo§. 

It will be convenient to introduce the following effective measure: 

1 ^ fU tT /5N [dq(i)dq(£)} j, z}]g[{q, q, *}]), 

fUti /SN [dq(£)dq(£)}(Mi[{q, q, z}]). 
§ This is the point, therefore, where the inevitable continuity assumptions regarding our macroscopic 
dynamic observables enter. In the present derivation these take a more transparent form than in |10|. 
where they were hidden inside the details of the temporal regularization. 



( 9 [{ q , q, *}])* = lim ± V J n ^ m ^mm^g^nyixq.q^w^ (26) 
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Mi[{q,q,z}\ =p (q(0))e L w J ^ 



x e 



-r 2V^*max/f5jV 



L(£/)9(f)g(<')+A'(^') CT [ g (€),z(0]g(£')+6(</) t r[ g (f) )2 (£)] < r[ 9 (<'), Z (<')] 



(27) 

Upon substituting £ max = i m ax/<W into our equations ()21I22I23|) . followed by appropriate 
re-scaling of the conjugate order parameters, these three functions acquire the following 
form (modulo irrelevant constants): 

$ = Z S N 2 Yl [ c (^ 0<?(4 £') + K(£, £')K(£, £') + L{£, £')L(£, £')] (28) 

«'<tmax/<5]V 

$ = ^ lo &(J VAvA ^ A \ Z }) {z} ( 29 ) 
fi = ^E lo g/ II [*K*)d$0] (M,[{g,g,z}]) z (30) 



£=1 



with 



WL4, i|Z] = e ^^i x/% MmW-M^-k El7*i SN w[iAA,z\M[tf;A,A] ^ 

It is clear that \I/ and f2 now have proper iV — > oo limits. The canonical choice of 5jv is 
subsequently determined by the mathematical condition that lini/v^oo $[C, f^, L] ^ 0, 
but finite. It follows that (J2UJ) is again dominated by its physical saddle-point, and we 
are nearly back in familiar territory. 

3.3. The saddle point equations 

In order to eliminate the fields {ipi(£), 9i(£)}, and thereby simplify our equations, we next 
extract the physical meaning of our order parameters from the generating functional by 
taking appropriate derivatives with respect to these fields. This gives 



C(l,t) = lim -J2(s l (£)s l (£')) = (a[q(£),z(£)]a[q(£'),z(£')]U (32) 

N— >oo iv 



G ^<") = j) ? ^me) = -^(D.mmU (33) 

=^^m£m = - mm} ' (34) 

Thus at the physical saddle-point of (}20|) we have the usual relations L(£,£') = and 
K (£,£') = iG(£,£'), where G denotes the single-site response function. Upon varying 
{C, K, L) in (J2Uj) we reproduce self-consistently the by now standard equations 

C(£, £') = (a[q(£), z(£)]a[q(£'), z{t)])* (35) 

G(£,£')= -i(a[q(£),z(£)}q(£% (36) 

L(£,£') =(q(£)q(£')}* = (37) 

We turn to variation of the order parameters {C, K, L} in \I/ + $ (as Q only depends 
on the conjugate order parameters). In working out derivatives of $ we observe that 
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the conjugate bids effectively act as differential operators, i.e. A(s) — > id/dA e (s). This 
gives us our remaining three saddle point equations: 



if 



JVAVA W[A, A\Z] W[s, s'; A, Z] 



AN5 N 2 {JVAVA W[A, A\Z}) 

JVAVA W[A, A\Z] W[s, s'; A, Z\A{s') 



= nm 7^" j '^g^^Im (39 ) 

iv-oo 2N5 N 2 (JVAVA W[A, A\Z\) z 



( JVAVA W[A,A\Z] W[s,s';A,Z]A(s)A(s') /i/[ 

Lis, s') = lim ^ -, s s r ^ (40) 

tf-oo 4A^v 2 (/PAPA w[A ^]) m 

At the physical saddle-point, we may use L = and the symmetry of W[. . .] to simplify 
the function M[£, £'; A, A] which occurs in the measure (|3T|) to 

M[£, £'■ A, A] = A{£) [1 + C(£, £')]A(£') - 2if)A{£)G{£, £')A(f) (41) 

The generating fields {ipi(£)} are now no longer needed and can be removed. The 
perturbations Qi are still useful for calculating the response function G, but can be 
chosen site-independent, i.e. = 0(1). The measure (J27)l will then lose its site 

dependence. Also the functions have at this stage become obsolete. We may 

define a new time t = £Sn = O(N ), which will be real- valued as N — > oo, and we may 
take the limit N — ► oo in the definitions of our observables. The latter can subsequently 
be written in terms of the new real- valued time arguments, C (£,£') — > C(t,t') (and 
similar for the other kernels). 

4. The resulting theory 

4-1. Simplification of saddle-point equations 

We may now summarize our saddle-point equations for {C, G} in the usual compact 
way, in terms of an effective single agent process: 

C(t, t') = (sgn[g(t)]sgn[g(t')]>* G(t, t') = -i(sgn[q(t)]q(t% (42) 

with a measure which is defined in terms of path integrals, as in JU] (and with time 
integrals running from t — to t = i max ): 

/ rr . m _ ${dqdq} (M[{q,q,z}] g[{q,q,z}]) a 

* [{q > q > Z ™*- J{dqdq}(M[{q,q,z}]) z ^ 

M[{q, q, z}] = po(g(0)) e^ dt «*)[&«(*)-'(*)- J 'dt> &{t> \t)a[ q (t'),z(t')]} 

x e -ijdtdt'[L(t,t')mm+c(t,t'Mq(t),z(t)Hq(t')At')}] (44) 

To find the kernels {C, K, L} we have to evaluate equations ()38l39l40j) further, 
remembering that the left-hand sides as yet still involve the integer time labels (s,s'), 
rather than the continuous times. Now the scaling chosen for 5n with iV which we 
adopt will be crucial. We observe that all complications are contained in the evaluation, 
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for large N and for any given realization of the fake market information path {Z}, of 
objects of the following general form (with all operators evaluated at the saddle-point): 

(45) 



(Q[{A}]){A\z } = jVAVA W[A,A\Z] Q[{A}] 

We can confirm, by repeating the steps taken in evaluating the disorder-averaged 
generating functional Z[i/j] but now for calculating averages of arbitrary functions of 
the overall market bid path {A}, that the physical interpretation of the measure (J45|) is 



lim (Q[{A}}) 

N— >oo 



(Q[{A}]) {A \z } 



{Z} 



(46) 



Thus (|43j) defines the asymptotic disorder-averaged probability density for observing a 
'path' {A} of global bids, for a given realization of the fake history path {Z}. To evaluate 
(|43|) we introduce two path-dependent matrices G[A, Z] and D[A, Z], with entries 

G[A, Z] (£, £') = W[£, £'; A, Z]G{£, £') (47) 
D[A, Z](£,£') =W[£,£';A,Z}[1 +C (£,£')] (48) 

Definition (JT7|) tells us that G[A, Z)(£, £') = G(£, £') if the 'history' observed at stage i is 
identical to that observed at stage £', and zero otherwise, and similarly for the relation 
between D[A, Z](£, £') and 1 + C(£, £'). We now use auxiliary integration variables {<pe} 
to linearize the term in the exponent of (|4"5j) which is quadratic in A, and use causality 
of the response function G where appropriate: 



(Q[{A}]) {Alz} 



</<5a 



n 



dA(£)dA(£) affl[Affl-i4«tf)+foy: , G\A,z\(i,t')A(i')} 



X 



=i 



2n 



Q[{A}} 



1=1 



■JSff 



-Yf%=i MD- l [A,Z)) 



u'Ve' 



</<5jv 



[ n dA{£)\ Q[{A}] 

( n s 



A{£) - A e (£) + G[A, Z]{£, t')A{£') - 



>\A,Z} 



Here (. . .){$\a,z} refers to averaging over the zero-average Gaussian fields 4>t with {A, Z}- 
dependent covariance ((fii4>e'){<t>\A,z} = \D[A, Z](£,£'). We conclude from our expression 
for (Q[{^4}]){A|z> that the conditional disorder-averaged probability density P[{A}|{Z}] 
for finding a bid path {A}, given a realization {Z} of the pseudo-history, is given by 



V[{A}\{Z}] 



*/Sn 

n s 

i=i 



A(£) - A e (£) + -fjJ2 G(£, £')W[£, £'; A, Z\A{£') 

Z £'<l 



{4>\A,Z} 

(49) 



with (Q[{A}]) {A \ Z} = J[U e dA(i)]V[{A}\{Z}]Q[{A}]. Causality ensures that the density 
f!49|) is normalized, since both 0£ and G[A,Z] (£,£') involve only entries of the paths 
{A, Z} with times k < £. 
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Having established ()49|1. our equations (|38I39I4(J|) can be simplified considerably. 
We immediately find that C — 0. To simplify comparison with the theory of JU] 
(corresponding to £ = 1), we will make a final change in notation and put 

K(£, £') = -aR(£', £) L(£, £') = --aiZ{£, £') (50) 

This allows us, with p = aN and in anticipation of our expected time scaling Sn = fj/2p 
(known from the analysis in jTUj of the Markovian limit ( = 1), to write the remaining 
equations ()39|40|) in the simple form 

£(?,?') = ^l^({W[t,e-,A,Z]A(t)A(e)) wz} )\ (52) 

We see that R defines a response function associated with external bid perturbation, 
and hence obeys causality: R(£, £') = for £' > £. This, in turn, enables us to simplify 
equations (J4"2*|) for {C, G} and the measure (. . .)* to a form identical to that found in 
jl()j for the Markovian ('fake history') on-line MG: 

C(M') =(o-[q(t),z(t)]a[q(t'),z(t')]), (53) 
(n u n A]) _ gjg) (9[{q,z}] M[{q,z}]) z 

{9[{q ' }lU ~ Rdq} (M[{q,z}]) m (55) 
M[{q,z}} =A)(?(0)) J{dq} e-^ a I dtdt ' *WWW) 

x e iJdtq(t)[jLq(t)-9(t)+afdt>R,(t,t')*lq(t>),z(t')}] 

4-2. Summary and interpretation 

We recognize that (J5HJ) describes the usual effective single-trader equation with 
a retarded self- interaction and zero-average Gaussian noise rj(t) with covariances 

d -q(t) = 6(t) - a fdt' R(t, f) a[q(t')\ + ^ V (t) (57) 



dt Jo 

We have used the fact, as in [TQJ , that the discontinuity of the correlation function 
for equal times, i.e. C(t,t) = 1, will in the continuous time limit be irrelevant. This 
implies that we may carry out the averages over the decision noise and are left only 
with expressions involving o~[q] = fdz P(z)o~[q, z], and that (with the exclusion of t = t', 
where one has C(t,t) = 1) the order parameter equations ()53I54|) simplify to 

C(t,t) = (a[q(t)]a[q(t')]% G(t,t') = J-(a[q(t)]}, (58) 

Our remaining problem is to solve the order parameters {R, £} from (|51I52|) . To 
do so we must select the canonical time scale 5n such that the iV — > oo limit in 
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(|51I52|) is both non-trivial (i.e. <V sufficiently small) and well-defined (i.e. <V not 
too small). For the special value ( = 1 we know JU] that 5^ = rj/2p. Although here 
we have followed a different route towards a continuous time description, we show in 
Appendix A| that indeed 5n = fj/Zp, by working out our present equations in detail for 



the fake history limit ( —>■ 1. Given this canonical time scaling and given the definition 
W[£, £'\ A, Z] = 5^ e A z , A z y we find our equations ()51I52|) taking their final forms: 



S(t,0 hm 1 ((A(i)A(f)5 X{w)M , AZ) 



(59) 

=t/s N ,e=t'/s N 

(60) 

e=t/s N ,i'=t'/s N 



with S/SA e {£) = 5 N - 1 d/dA{i). Here ((. . .)) AiZ refers to an average over the stochastic 
process ()49|) for the overall bids {A} and over the pseudo-history {Z}. The bid evolution 
process can be written in more explicit form as 

A(£) = A e (£) + fa - hj ^ G(£, £')5 X{w)Me ,^ z) A(£') (61) 
z i'<i 

with the zero-average Gaussian random fields {(f)}, characterized by 

{Me)i4\A& = \[1 + C(£,£')} S\ iw)M >,A, Z) ( 62 ) 

Equation (jHTj) is to be interpreted as follows. For every realization {Z} of the fake 
history 'path' one iterates ()61|) to find successive bid values upon generating the zero- 
average Gaussian random variables <pe with statistics f!62D (which depend, in turn, on 
the recent bid realizations). The result is averaged over the fake history paths {Z}. 

Let us now summarize the structure of the present theory describing the MG with 
true market history in the limit — > oo, by indicating the similarities and the differences 
with the previous theory describing the on-line MG without market history: 

similarities between the theory of real and fake history MGs: 

• The MG with real history is described again by the effective single agent 
equation (j57j) . from which the usual order dynamical order parameters {C, G} 
are to be solved self-consistently via 

• The scaling with A^ of the characteristic times in the MG with history is 
identical to that of the MG without history, if we avoid highly biased global 
bid initializations (where the MG with history acts faster by a factor y/~N). 

differences between the theory of real and fake history MGs: 

• Real and fake history MGs differ in the retarded self-interaction kernel R and 
the noise covariance kernel S of the single agent equation. Without history, 
{R, £} were found as explicit functions of {C, G}. With history they are to be 
solved from an effective equation (ffiTj) for the evolving global bid. 
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the effective global bid process: 

• The effective global bid process (J5TJ) is itself independent of the stochastic 
effective single trader process (|57|). The two are linked only via the (time 
dependent) order parameters occurring in their definitions. 

• At each stage in the process (foTj). the bid A(£) is coupled directly only to bids 
in the past at times £' with identical realization of the M-bit history string. In 
addition, only those effective global bid noise variables (pi are correlated which 
correspond to times £ with identical realizations of the M-bit history string. 

The differences between the two 'fake history' definitions (jBE| (i-e. consistent versus 
inconsistent) are seen to be limited to the details of the averaging process (. . -){z}- 

In jAppendix A| we show how one can recover from (|57|filj) the earlier theory of ^Hj 
in the fake history limit £ — > 1. This exercise serves two purposes. Firstly, it confirms 
that the canonical time scale of our process is indeed given by 5n = fj/2p (modulo an 
irrelevant multiplicative constant). More importantly, being the simplest instance of 
our presently studied class of MG models, it provides useful intuition on how we might 
proceed to find solutions of our effective processes ()57J61|) in the general case. 



5. The role of history statistics 

We continue with our analysis of the full MG with history, and next show that all the 
effects induced by having real market history can be concentrated in the statistics of 
the M-bit memory strings A of (j!5j) . More specifically, the core objects in the theory 
will turn out to be the following functions, which measure the joint probability to find 
identical histories in the effective global bid process (JfiTj) at k specified times {£i, . . . , £ k }, 
relative to the probability p~ k for this to happen in the case of randomly drawn fake 
histories and non-identical times: 

k 

A k (£ u . . . , 4) = P k - X E (( II S XM^AZ) >W} ( 63 ) 

A <=i 

We have abbreviated J2\ — S,\e{-i i} M > w ith 2 M = p = aN. For any value of k one 
recovers in the random history limit and for non-identical times lim^! A^(. . .) = 1. For 
k — 1 one has A\(£) = \a a z))){A z } = 1> ^ or an y C- m contrast, for arbitrary 

( (i.e. when allowing for real histories) and k > 1 the functions are nontrivial. 

5.1. Reduction of the kernels {R, £} 



We will follow as much as possible the steps which we took in |Appendix A in order to 



recover the ( = 1 equations (|A.9IA.14j) . We re- write the global bid equation (JoTj) as 
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and we formally invert the operator on the left-hand side, using <V = fj/2p: 

A(£)=A e {£) + <f>e + '£{- 7 ly £ G{t^)G{i u t 2 ) . . .G(4-i,4) 

r>0 1 U...l T 
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X 



7 

[if. 

i=i 



\(e,A,Z),\(£i,A,Z) 



[A e (e r ) + <f>i r ] 



(64) 



Expression is itself not yet a solution of ()61|). since the bids {A(s)} also occur inside 
the history strings , A, Z) at the right-hand side. We now insert flSH) first into (JoT?)). 
and consider only infinitesimal external bid perturbations A e , so that we need not worry 
about indirect effects on A{£) of these perturbations via the history strings A(s, A, Z): 



R{t,t/)=6(t-t')+}im JEC-W 1 E £i)G{£i, £ 2 ) ... G(£ r _x, 

on— »0 



r>0 



r-1 



x P ((^A(£,a,z),A(^,a,z) IT ^A(M,z),A(4, 



i=l 



A ' Z )//{A,Z} 



£=— £'= — 

s N ' s N 



= 6(t-t')+\im E G(4^i)...G(4-i,4) 

[r>0 ^...4.-1 

x A r+1 (4, 

Similarly we can insert ()64|) into (joTIj) . again with A e — > 0, and find 

E(t,0=^lim ^1 £ (-6 N f +r ' G{£ Q ,£ x )...G{£ r _ u £ r ) 



r 

[IF; 



3 A«o,A,X),A(4,A^)J [ II ^A(^ n , A,Z),\(t,A,Z) 

i=l j=l 3 



{A,Z} 



(65) 



hm 2 (-W +r ' E G{e 0l e 1 )...G(£ r . u £ r ) 

*v-0 I ^ ^ 



E G(4^)...G?(Ci,0 + 



t 1 ...t, r 



X A r+r ./_|_2(£o ; • • • > ^r> ^Qj • • • j £' r ' 



(66) 



The limits 5at — > in ()65I66|) are well-defined, since each time summation combines with 
a factor 5^ to generate an integral, whereas pairwise identical times in (J66j) leave a 'bare' 
factor 5n but will also cause A r+r / +2 (. . .) to gain a factor p = fj/28 N in compensation. 

Since the single agent process (joT|) is linked to the global bid process flBTj) only via 
the kernels {R, E}, we conclude from ()65I66|) (which are still fully exact) that the effects 
of having true market history are concentrated solely in the resulting history statistics 
as described by the functions (jHSJ)- More specifically, there is no need for us to solve the 
global bid process (JBTj) beyond knowing the history statistics which it generates. 
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5.2. Time-translation invariant stationary states 

In fully ergodic and time-translation invariant states without anomalous response, we 
could in 'fake history' MG versions find exact closed equations for persistent order 
parameters without having to solve for the kernels {C, G} in full, and locate phase 
transitions exactly. This suggests that the same may be true for MGs with true history. 
Thus we make the standard time-translation invariance (TTI) ansatz for the kernels in 
(|57j) and for the correlation- and response functions: 

C(M') =C{t-t') G(t,t') = G(t-f) 
R(t,t') = R{t-t') E(M') = E(t-t') 

with x = fo°dt R(t) finite. It turns out that several relations between persistent 
observables in TTI stationary states of the present non-Markovian MG process, 
if such states again exist, can be established on the basis of (|57|) alone. Upon 
following established notation conventions and abbreviating time averages as / = 
lim^oo r _1 f^dt f(t), we may write the time average of (JH7|) as 



dq/dt = 9 — ax R cr + y/a rj (67) 

with x R = Io°dt R(t). We may now define the familiar effective agent trajectories 
corresponding to fickle versus frozen agents as those with either dq/dt = or dq/dt ^ 0, 
respectively. For frozen agents, consistency demands that sgnfcf] = sgn[dq/dt]. It then 
follows from (J67j) that the (at least for x R > complementary and mutually exclusive) 
conditions for having a 'fickle' or a 'frozen' solution can be written as follows: 

fickle : \9 + y/afj\ < ax R o~[oo], a = ^~ - (68) 

frozen : \6 + \fatr}\ > a% K cr[oo], a — a[oo].sgn - (69) 

Which solution of (jH^j) and (jnHj) we will find depends on the realization of the noise term 
rj, which is a frozen Gaussian variable with zero expectation value and with variance 

S Q 2 = (rf = lim — J T dtdt' S(t, t') = S(oo) (70) 



We may now proceed as in |Appendix A towards the calculation of the persistent order 



parameters 0, x an d c, where (J) denotes the fraction of frozen agents in the stationary 
state, where x — fo°dt G(t), and with 

c = lim C(t) = lim \ rdtdt' (a[q(t)]a{q(t')]), = (a 2 )* (71) 

Upon introducing the short-hand u = ^fax R o~[oc\/ S \/2, and upon using the conditions 
and relations ()68|69j1 . we find in the limit 9 — » of vanishing external fields: 

( t )= 1 7T^r= e~^ 2/s o9\\fj\ - ^x R (r[oo] 



1 - Erf [m] (72) 
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drj 



+ 9[Vax R o-[oo} - \r}\ 
= a>[oo]{l - ErfH + ±Eri[u] - ^=e- 2 } 



rf 
aX 2 R 



(73) 



S V2tt d(\/av) 
= Eri[u}/a XR (74) 

Hence, in order to find the TTI stationary solution {0, c, x} an d the phase transition 
point (defined by x °°)> we om y need to extract expressions for x R an d So from the 
stochastic overall bid process (jBTJ). Using ()65I66|) . the latter can be written as 

Xr = / dtRit) 

j 

= 1 + Jim { E(-^) r E G ( £ i - ^)G{h -is)... G(4-i - i r )G(£ r ) 

x A r+ i(£i, ... ,£ r ,0) I (75) 

5 2 = lim -L x: ; im n I e (-^) r+r ' e - *o • • ■ G{tr-i - 4) 

L ^ OOL %0^<L <5Ar ~ > ° U'>0 h..2r 

x ^ G(^ Q - £[) . . . G?(Ci - O [1 + C(4 - C)] 

x A r+r , +2 (4,...,4,4--->C)i (76) 



5. 3. TTI states with short history correlation times 

Calculating the history statistics kernels (J55J) from the global bid process (|UT)l is hard, but 
in those cases where the history correlation time (measured in individual iterations 
£) in the process is much smaller than N, we can make progress in our analysis of TTI 
stationary states. We define the asymptotic frequency 7T\ (A, Z) at which history string 
A occurs in a given realization {A, Z} of our process (JtTTf as 

1 L 

n \( A > z ) = i E 6 \Mw) ( 77 ) 

Obviously Y^\^\(A, Z) = 1. For ( = 1 (no history) we would have tt^ = p^ 1 for all 
A. We may also define the distribution g(f) of these asymptotic history frequencies 
it^(A,Z), relative to the benchmark 'no- memory' values p -1 , and averaged over the 
global bid process (j61|) in the infinite system size (i.e. continuous time) limit: 

6(f) = I E (W - P*\(A, Z)\)) {A , Z} (78) 
p A 

Our definitions guarantee that J °°df fo(f) = 1 for any (. For ( = 1 we simply recover 
s(f) — $[f ~ 1]> i- e - all histories occur equally frequently. We have not yet shown that 
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<?(/) 



f 



<?(/) 




Figure 1. Typical examples of history frequency distributions l|78|) as measured in 
simulations of the on-line MG without decision noise but with full history (i.e. £ = 0), 
after equilibration. Here N — 8193. Left: a = 0.125 (in the non-ergodic regime of the 
MG, below a c ). Right: a — 2.0 (in the ergodic regime, above a c ). 



the limit in (|78|) exists, i.e. that the history frequencies do indeed generally scale as 
t^\{A, Z) = 0(N~ V ). Numerical simulations, however, confirm quite convincingly that 
this ansatz is indeed correct (see e.g. Figure Q). 

If Lh is the history correlation time in the process ()61|). then finite samples of history 
occurrence frequencies can be expected to approach the asymptotic value (|77jl as 



1 e+L 

W £ 5 \Mi',Az) = ^ Z ) [l+0{{L h /LY 



(79) 



This implies that in expressions such as f75]). where G(£) — G(£') = 0(\£ — £'\/N) and 
where only time strings 
will survive the limit Sn 



A r +i(£i, 



This results in 



,£k} with mutual temporal separations of order O(N) 
y 0, we may choose e.g. L = \/L h N and effectively replace 

4,o) - /Ekl 1 + °(^)] irTl 



n r+l 



0) 



P ¥-EE(-X) r pK X [l + 0(y/L h /L)] 



1 Wx [l + O^L h /N)\ 

hm - > 



df g(J) - 
o 1 



./ 



1) 



r»°°p*Zl + XFirx [l + (y/L h /N)] J ° + 

(provided indeed limAr^oo Lh/N = 0). The same simplification to an expression involving 
only the distribution g(f) can be achieved in (J7HJ), but there we have to be more careful 
in dealing with the occurrences of similar or identical times in the argument of (|63|) . We 
first rewrite (J76|) by transforming the iteration times according to 



for all i G {0, . . . , r} : 



j=i 
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This gives, using lim^oo G(s) = (i.e. restricting ourselves to ergodic states with 
normal response): 

So = e i-^y +r ' E G( ao ) • • • G( ap _o E g( Sq ) . . . 

^ r,r'>0 a ...*r-l>0 s'.s' >0 

U r'— 1 

x ; im 7^ E E [i + c( s ,-40] 



x A 



r+r'+2 



(s + • • • + S r , 



r — 1 "I - ^t'j 



Each time summation is compensated either by a factor 5^ (giving an integral), or 
limited in range by L and compensated by an associated factor so that any 'pairing' 
where two (or more) times are close to each other (relative to the correlation time L/J 
will not survive the combined limits 5n — > and L — > oo. Thus we may again put 

r+r'+2 



A 



r+r'+2\ 



V 



r+r'+l 



E K\[l + 0(jL h /N)) 



(82) 



and find, with C(oo) 



c: 



<j2 



p7T A [l + 0( V 'WA r )] 



:i+-=)to-E5:(-xr 

^ A r,r'>0 

,, M , y 1^ [ P vr A [l + 0(.yZ^)] 
1 + c) hm - > — ===— 

A [l + X p* X [1 + 0{JlJn)] 



-i r+r'+2 



n 2 



:i + c) 



oo p 

df g(f) t— — 
o (l + X/) 2 



(83) 



Since only x R an d 5*0 are needed to solve our effective single agent process in TTI 
stationary states, we see that upon making the ansatz of short history correlation times 
Lh <C N the effects of history on the persistent order parameters in the MG are fully 
concentrated in the distribution g(f) of history frequencies, as defined by (J7H|) . Once 
g(f) has been extracted from the process (|61|). the TTI order parameters are given by 
the solution of the following set of equations: 

a[oo]y/ax R 1 - 



u 



S V2 



X 



a 00 



Erf [u] + — Erf [u] 



1 - Erf [u] 



r-2 
°0 



oo f 

df Q(f) ~r~~ 7 
1 + Xf 



:i + c) 



df g(f) 



a + xf) 2 



(84) 
(85) 
(86) 
(87) 



For C = 1 (the fake history limit) we have g(f) = S[f — 1] , leading to x R — (1 + x) _1 an d 
5o = vT+~c/(l + x), and the above equations are seen to reduce to the corresponding 
ones in [TU], as they should. 
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6. Calculating the history statistics 



Upon making the ansatz of short history correlation times in the MG, we have 
shown that finding closed equations for persistent TTI order parameters boils down to 
calculating the distribution g(f) of relative history frequencies, as defined in (JZHJH- Our 
remaining programme of analysis is: (i) finding an expression for g(f), (ii) expressing 
this distribution in terms of the persistent order parameters {c, 0, x, X R i ^o}, an d (in) 
confirming retrospectively the consistency of assuming short history correlation times. 



6.1. The moments of g(f) 

The distribution (|7Hj) is generated by the non-Markovian process (16 1)1 . which we cannot 
hope to solve directly. However, we can get away with a self-consistent calculation which 
does not require solving (pjTj) in full. We focus on the moments of the distribution g, 
from which the latter can always be recovered (if the integrals below exist): 

POO 

A** = / df g(f)f k (88) 
Jo 

Obviously //o = l^x — 1, for any (, which follows directly from definition (|75jl . In the 
absence of history (i.e. ( = 1) we have g(f) = 5[f — 1], so that fM k — 1 for all k > 0. 
We will rely on the sum over moments in ()89|) converging on scales of k which are 
independent of N . This is equivalent to saying that the limit (|75|) is well-defined, so it 
does not restrict us further. By combining the definitions 1)77178188)) and (|15p. we can 
obtain a more explicit but still relatively simple expression for the moments 

0* = ~ E ((b™\( A , z )f)){A,z} 
p x 

pk L M | fc 

= 2™7i E ((II {5 E n*ww})) M 

£i...4=l i=l z A=±li=l 1 J 

fc— 1 Z/ AZ /c 

= 2™ V, ^({njn/iA^A^+n^-iA^^)}))^/^) 

The average ((. . )){a,z} in the last line of d§Uj) equals the following joint probability: 



((. . .)) {A;Z} = Prob 



{A 1 (£ 1 ,A,Z) = A 1 (£ 2 ,A^)= ••• =Ai(4,A^)} 
and{A 2 (£ 1 ,A,Z) = A 2 (£ 2 ,A,Z)= ... =A 2 (4,A^)} 



and{A M (4,A^) = A M (4,A^)= ••• = A M (4,A^)} 



(91) 



|| A similar conclusion was reached also in |15|. but on the basis of several approximations. Furthermore, 
in contrast to the present GFA approach, in ^H] there was no way to calculate g(f) from the theory. 
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Let us define the short-hand 

Same(i) = {X^, A, Z) = \(£ 2 , A, Z) = . . . = A* (4, A, Z)} (92) 

which states that the z-th component of the history string takes the same value at the 
k specified times {£i, . . . ,£k}- Given that our bid process obeys causality^, statement 
(|9~T|) can be written as 

((. . .)) {AZ } = Prob[ Same(l) A Same(2) A ... A Same(M) 

= Prob [Same (1) | Same(2) A ... A Same(M)} 
x Prob [Same(2) \ Same(3) A ... A Same(M)} 



x Prob [Same (M-l) | Same(M)} 

x Prob [Same(M)} (93) 

Since we need not consider values of k which scale with iV or L, the contributions to 
()90|) from those times {£i, . . . ,4} for which there are correlations between objects at 
a time i r and those at another time l r i will vanish in the limit L —>■ oo. Since we also 
know that we are in a TTI state, it follows that the conditional probabilities in (}9~3*j) will 
not depend on the actual values {£±, . . . ,£k}- In the limit L — ► oo we may replace 

Prob[SWie(r) | Same{r + 1) A ... A Same(M)] — > V[k\M-r] 

where V\k\ m ] denotes the probability to find for randomly drawn and infinitely separated 
times {£i, . . . , £k\ that Aj(£i, A, Z) — . . . — Xi(£k, A, Z), for an index i, given that the 
identity holds for the indices {i + 1, . . . , i + m} (with V[k\o] giving this probability in the 
absence of conditions). This allows us to write (J9*U|) as 

t^k=p k 1 V[k\M-l]-'P[k\M~2] ■■■ 'P[k\l]-'P[k\0] (94) 

As a simple test one may verify (J9"2|) for the trivial case £ = 1 (fake history only). Here 
conditioning on the past is irrelevant, so V[k\ m \ = ^[fc|o] — 2 1_fc for all m, which indeed 
gives us /ifc = p fe - 1 2( 1_fc ) M = 1 (as it should). In the continuous time limit — » oo 
(equivalently: for M — > oo, since 2 M = aN) we thus find the as yet exact formula 



M-\ 

lim log(/i fc ) = lim log |2 fc-1 ^|r] I (95) 

M— >oo M— >oo — " L 

r=0 



6.2. Reduction to history coincidence statistics 

Next we have to find an expression for the probabilities V\k\r\- We know from (jfilJ62j) 
that the value of the overall bid at any time £ is only correlated with the bid value at time 
£' if the two times (£,£') have identical history strings, i.e. if X(£,A,Z) = X(£',A, Z). 

% We here use the fact that a component Xi(£,A, Z) of the history string observed by the agents at 
time I is by construction (see definition l|15f) ) referring to the overall bid at time £ — i. It follows that 
the probability of finding a given value for Xi(£, A, Z) depends via causality only on the bids at the 
earlier times {£ — i — 1,1 — i — 2, . . .}, hence on {Ai+i(£, A, Z), Ai + 2(£, A, Z), . . .}. 
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We know that individual histories show up during the process with probabilities of order 
N~ x . Since the likelihood of finding recurring histories during any number r = O(M) 
of consecutive iterations of our process is thus vanishingly small (of order 0(M/N)) 
such direct correlations are of no consequence in our calculation. The only relevant 
effect of conditioning in the sense of the V[k\r] is via its biasing of histories in subsequent 
iterations. Although the probability of history recurrence during a time window of 
size O(M) is vanishingly small, if two (short) instances of global bid trajectories are 
found to have identical realizations of some of the bits of their history strings, they will 
nevertheless be more likely than average to have an identical history realization in the 
next time step. This is the subtle statistical effect which, together with the resulting 
biases in the bids which are subsequently found at times with specific histories, gives 
rise to the relative history frequency distributions g(f) as observed in e.g. Fig. [T] 

The statement that the conditioning in V\k\ r ] acts only via the joint likelihood of 
finding specific histories {Ai, . . . , A*.} at the k specified (and widely separated) times 
{£x, ■ . ■ ,£k}, translates into 

V[k\r]= E V[k\\ l ,...,X k ] P[X 1 ,...,X k \r] 
X 1 ,...X k 

Here V[k\ Ai, . . . , X k ] denotes the likelihood to find \{£\, A, Z) = 
the history strings at those k times equal {Ai, . . . , A&}, and "P[Ai, . 
likelihood of finding those k specific histories given that the bits of the k history strings 
have been identical over the r most recent iterations" 1- . The probability of finding specific 
bid values A(£) will in TTI states only depend on the history string A associated with 
time t. Given this history string, A(£) is a Gaussian variable (this follows from the 
effective bid process (16 lj) ). with some average A x and a variance cr 2 ^ (which will in due 
course have to be calculated). Using also the fact that the Z(£,i) were defined as zero 
average Gaussian variables, with variance k 2 , we obtain: 

k 

V[k\X x , . . . , X k ] = f[ [ [DZ [dA P X (A)6[(1 - QA + (Z\ 



(96) 

= X(£ k ,A,Z), if 
AJr] denotes the 



3=1 



n 



+ U[jDZ JdAP Xj (A)9[-(l-0A 
(1 - C)Av 



CZ] 



- + -Erf 
2 2 



x/2 + a - o H 



k 

+ n 

3=1 



1 - i E rf 

2 2 



(97) 



■^C 2 « 2 + (i-C)H 

We now write the sum over all combinations of histories in (|9Tj) in terms of a partitioning 

+ Here one will find that consistent and inconsistent realizations of the history noise variables Z(£,i) 
are to be treated differently: in the case of consistent noise, one will always have Xi(£,A,Z) = 
Xi+i(£ + I, A, Z). This is not true for inconsistent history noise. 
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in groups, where two M-bit strings {A,, A.,} are in the same group if and only if they 
are identical. We write (gi, g 2 , ■ ■ .) for the subset of all combinations {Ai, . . . , A^} with 
one group of size gi, a second group of size g 2 , and so on*. Clearly g\ + g 2 + . . . = k, 
for all possible subsets of our partitioning. This allows us to write 



V[k[ 



E S k,g 1 +g 2 +...'P[k\gi, 92, 
(91,92, ■■■) 

According to (|H7jl . the distribution V[k\gi,g 2 , 
7>[%i, £ 2 , . . .] = 2 1 - fc $(£ 1 , 3 2 , . . .), with 



V[gi,g 2 , 



(9* 



$(01) 02, 



1 j>i I A 



7M 



Z 3>1 



1 + Erf 



1-Erf 



is of the relatively simple form 

a - Q2 A 



V^/c 2 * 2 + (1 - C) 2 ^ 
(! " O^A 



9j ' 



v^/c 2 * 2 + (i - C) 2 ^ 



9j ' 



(99) 



Insertion of the representation (|9*H|) for "P[fc| r ] into (JHH|) allows us to write the moments 
of the relative history frequencies in the following form: 

M-l 



Km log(// fc ) 

M^OO 



lim log 



r=0 



M9i,92, ■ ■ ■)'P[9i,92, 



(100) 



E ^,91+92- 

_(Sl,92,-) 

It will be helpful to assess which values of r in (jlOO)) can survive the limit M — > 00. 
Whenever we have a value r such that M — r — > 00 as M — > 00, the condition that the 
history bits were identical over the most recent r steps still leaves a large 0(2 M_r ) 
number of compatible history strings to be found at the probing times {£\, . . . ,£k}, so 
the likelihood of finding histories coinciding in multiples (gi,g 2 , ■ ■ .) scales as 



V[l,l, 



1 + 0(2 



r-M\ 



V[ gi ,92,...\r]= J] 0{2^ 

il9j>i 

Since k is finite and $(1, 1, 1, . . .) = 1, the total contribution to log(//j.) from those terms 
where M — r — > 00 as M - » 00 is negligible, since for 1 < -C M we may write 

R-l 

E lo §[ E S k,g 1 +g 2 +...^(9i,92,---)V[g 1 ,g2,...\r] 



r=0 



(91,92,-) 



E log 1 + °( 2 



r-M\ 



r=0 



0(2 



(101) 



Hence in (llOOj) we need only those terms where M — r is finite. These terms represent 
contributions where virtually all past components of the history strings at the times 
{£1, . . . ,£k} were identical, which should indeed constrain the possible histories at the 
times {£\, ...,£}.} most, and indeed gives the largest history coincidence rates. We 
consequently switch our conditioning label from the number r of previously identical 
components to the number M — r of unconstrained components, and write 

V[ gi ,g 2 ,...\r] = Q[ gi ,g 2 ,...\M-r] 

* For example: (fc) denotes the subset of all combinations {Ai,...,Afc} where Ai = ... = A^, 
(2,1,1,...) is the subset of all {Ai, ...,Afe} where precisely two history strings are identical, and 
all others are distinct. 
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and find (|1(J(J|) converting into the simpler form 

^lin^ log(/i fc ) = ^ log [ S k, gi +g 2 +...^(9i,92,---)Q[9u92,---\r]\ (102) 

r>l (91,92, •■■) 

We are left with the task to calculate the likelihood Q[gi,g 2 , • • • l r ] of finding at the k 
distinct times {£i, . . . ,£).} of our process the histories {Ai, . . . , A fe } to be identical in 
prescribed multiples of (gx, g 2 , ■ ■ .), given that the bits of the k history vectors were 
identical during all but r of the most recent iterations. 

At this stage we benefit from having to consider only values of r in (jlU2j) which are 
finite (compared to M, which is sent to infinity). For each value r of the number of 'free' 
components, there will be only 2 r possible history strings A available to be allocated to 
the k times {£i, . . . ,£k}- In principle one would have to worry about the probabilities 
to be assigned to each of the 2 r options. However, we know for the full M-component 
history strings that their probabilities scale as 7r^ = f\p~ l with f\ = 0(1), so the 
effective probabilities of individual components of A G { — 1,1} must scale as 

^ = O^f 1 ) = 0{\f^ M ) = \[1 + OiM- 1 )] 

From this we deduce that for finite r we may take all 2 r allowed history strings to 
have equal probabilities. This turns the evaluation of Q\g\, g 2 , . . .\r] into a solvable 
combinatorial problem. Each of k elements is given randomly one of 2 r colours (where 
each colour has probability 2~ r ), and Q[gi,g2, ■ ■ ■ \r] represents the likelihood of finding 
identical colour sets of sizes (gi,g2, • • •)• Let us abbreviate R = 2 r , and write the r-th 
term in (jlU2j) as \og(H r ). Now, using 2~ r<y9l+ '" 9R ^ = 2~ rk = R~ k we may simply writejj 

lim log(^) =5>gtf r (103) 

r>± 



r 



Hr= $ (^1^2,...)2[^1,^2,... 

(91,92,-) 

k k—gi k—gi—g2 fe— ffl— ffJi-l 

= E E E ••• E *(</i, <&,■■■) 

9i=0 92=0 93=0 g R =0 

k - gi \ ( k-g x -g 2 

92 ) V #3 

6.3. Expansion of sign- coincidence probabilities 





(104) 



Having simplified the conditional distribution Q[g%, g 2 , ■ ■ ■ \r] of history coincidences, we 
turn to $(51, g 2 , ■ ■ ■) as given by If we restrict ourselves to an expansion of in 
powers of the (random) bid biases in which we retain only the leading terms, our 

jj One easily confirms that our expression for H r is properly normalized. Upon choosing ^(.gi, gi, • • •) = 
1 one can perform the summations iteratively, starting from gn and descending down to g\ , which leads 
exactly to the factor R k to combine with the R~ k present. 
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problem simplifies further to the point where we can obtain a fully explicit expression 
for the moments fik- In Appendix B we derive the following compact relations: 



n 



E 



7P 



Erf 



(105) 
(106) 

(107) 



The results ()105I106|) imply that, rather than knowing the full probability distribution 
V[gi,g2, ■ ■ ■ \r] in ()100j) . we only need the (conditional) statistics of a modest number 
of relatively simple monomials. Expanding the exponential in ()106|) up to the relevant 
orders, and using J2j9j — k (which is always true inside (|104j0 produces 



Hz- 



~ 3>1 



k 



(108) 



4 ^2 .4^ 



, ^ 9h) - 2 E 9) - (k - 5) £ g) + ^- 3fcj + 0{<$) 

ij>l 



i>i i>i 

Since the combinatorial averaging process of ()104|) in this particular representation 
involves a measure which is invariant under permutations of the numbers {gi,g2, ■ ■ ■}, 
the average of ()108|) is identical to that of the following simpler function (with R = 2 r ): 



$eff(<?i,<? 2 ,---) = l + -n(Rgl-k) 
+ Ifl 2 \Rg{ + R(R - \)g\g\ - ARg\ - 2{k - 5)Rgf + k 2 - 6k 



(109) 



Instead of having to use full combinatorial measure ()104j) , we can therefore extract all 
the relevant information from the (joint) marginal distribution for the pair ((71,(72) only. 
Inserting (jl09|) into ()104|) gives us 



-n 



RG k 2 f - k 



+ -Q 2 



RG^q + R(R — l)G 2 ' 



k,R 
2 



4RG h 3 f - 2(k - h)RG K 2 f + k' 2 - 6k + C(fi 3 ) (110) 
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9r 
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(R-2) k -^glg b 2 



[III) 



Those combinatorial factors G^f which we need in order to evaluate ()110|) are calculated 
Appendix C| They are found to be 

1) 
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G 



k,R 
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k k{k- 
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Figure 2. Test of the predicted history frequency distributions (left picture, 

based on expansion of the moments /ifc up to first order in the width, /i^ = 
e -jfifc(fc-i)^ anc j (right picture, based on expansion up to second order, = 

e 2f2fe(fe-i)- T 2f2 fc(fc-i)(2fc-3)^ together with the data of Fig. ^ as measured in 
simulations for a — 2.0 and N — 8193. In both cases the second moment which 
parametrizes (I116f) and (|118|l was taken from the data: ^2 ~ 1.380. 



KR _ k 3fc(fc-l) , k(k-l)(k 

^3,0 — — H ~ b 



R R? i? 3 
k>R _ k_ 7fc(fc-l) 6fc(fc-l)(fc-2) fc(fc-l)(fc-2)(fc-3) 

4,0 i? 2 + i? 3 + i? 4 

fejR = fc(fc-l) 2fc(fc-l)(fc-2) fc(fc-l)(fc-2)(fc-3) 
2,2 i? 2 i? 3 i? 4 

Insertion of these factors into (jllOJI . followed by restoration of the short-hand R = 2 r , 
gives us the fully explicit expression 

H r = l + -Qk(k - l)2~ r + -Q 2 k(k - l)(k - 2)(k - 3)4" r + C(fi 3 ) (112) 
2 8 

We can now write explicit formulae for the moments of the relative history frequencies, 
and hence also for the distribution g(f) itself, in the form an expansion in a parameter 
fl which controls the width of this distribution. 

6.4- Resulting prediction for g(f) 



The result ()112j) . together with the earlier relation ()103j) and the geometric series leads 
us finally to the desired expression for the moments 

Tim log(fi fe ) = ~ttk(k - 1) - —£l 2 k{k - l){2k - 3) + 0(Q 3 ) (113) 
We see that this general formula obeys /i = /J>i = 1, as it should, and that 

hm n 2 = eMn'+'W (114) 

M— *oo 

Insertion into our earlier expression (J85j) for g(f) leads in the limit M — > 00 to a formula 
in which, at least up the relevant orders in Q, the insertion of a Gaussian integral allows 
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us to carry out the summation over moments explicitly: 



g(f) 



Z/l fc>0 



-iu 



k\ 
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Q 2 ) 



(115) 



We may use (J 114)) to express Q in terms of /z 2 , turning our expansion of the moments 
/ifc into an expansion in powers of log(/i 2 ). Depending on whether we wish to take our 
expansion only to order 0(log(/z 2 )), or also to 0(log 2 (/i 2 )), we obtain 

p-^ 2 (/) 

(116) 



to C?(log(/x 2 )) 



*(/) 



/^/27Tlog( yU2 ) 

log(/) + |log(^ 2 ) 



-§* 2 (/) 



to 0(log 2 (^ 2 )) : g(f) 



*(/) 



l + U\og^ 2 )(3z(f)-z 3 (f)) 



/ v /27r[log( / u 2 )+log 2 ( / i 2 )] 
log(/) + |[log(/i 2 ) + |log 2 (/i 2 )] 
log(/ia) +log 2 (/x 2 ) 



(117) 



'1181 



(119) 



The two statements ()116|) and ()118|) are indeed found to constitute increasingly accurate 
predictions for the actual distribution of the relative history frequencies, see e.g. Fig. 
121 We have thus been able to explain the origin and the characteristics of the observed 
history frequency statistics. However, both formulae are expansions for small Q. Should 
(1118)) be applied to values of Q which are not small, one has to be careful in dealing with 
large values of /, where g(f) could become negative (this would have been prevented 
by the higher orders in Q). The implication is that in the Gaussian integral ()115)) one 
must in practice either introduce a cut-off z c = 0(VL~ l l & ), or exponentiate the factor 

[i + yio g w(3,(/)-z 3 (/))]. 



6.5. The width of g(f) 

What remains in order to round off our analysis of the distribution of relative history 
frequencies is to calculate the width parameter Q in (jllHJ) self-consistently from our 
equations. According to our theory, Q is given by ([107)1 . i.e. by 
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E^A 
A 



Erf 



A/c 2 ^ 2 + (i 
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(120) 



A 2 ^ — describe the statistics of those bids which 



The quantities and a 2 ^ 
correspond to times with a prescribed history string A. We know from (|64|) that these 
are Gaussian variables, which implies that A^ and cx 2 ^ are all we need to know. Since we 
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restrict ourselves to non-anomalous TTI states, we can write both as long-time averages: 

=^\}^ L ' l i:^\Mw ) m (121) 



e=i 

L 



(122) 



e=i 



We can work out the average A x , using and time-translation invariance, and 
subsequently define the new time variables Sj = l{ — £{ + i (for i < r) and s r = £ r 



(so that 



Sj + Sj + i + . . . + s r ). This results in 
1 



A x = lim -—J2Y,(- 5 nY E G{£ -£ 1 )...G{£ r _ 1 -£ r ) 
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(123) 



SQ...S r _l 
J L ~J2i<r Si r 

s r =0 1=0 

Given our ansatz of short history correlation times, in the sense of ([79)1 . and given 
X = E«>o^(^) < 00 ( so m ust decay sufficiently fast), we find this expression 

simplifying to 



^A = E(-XP^A) r — T E ^A,A( S , 
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r>0 

In a similar manner we find 



(124) 
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(125) 
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Again we use J2t G(£) < oo to justify that in the summations over s r and s' r , the upper 
limit can safely be replaced by L. Thus: 
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A 2 - 
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(126) 



The present calculation is similar to that of the volatility matrix in the fake history online 
MG [10J (the quantity a\ = A 2 X -A\ can be regarded conditional volatility, where 
the condition is that in collecting our statistics we are to restrict ourselves to those times 
where the observed history strings take the value A), so also here we have to worry about 
pairwise time coincidences. Each such coincidence effectively removes one constraint of 
the type 5 X A 



(...,A,zy 



since the latter will be met automatically. The remaining terms 
will occur in extensive summations, so that we may replace each 'unpaired' occurrence 
of a factor 5^ ^ Az y except for those with the same argument as one of the Gaussian 
variables 0, by its time average 7r^. In practice this implies the replacement 
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(127) 
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(128) 



As in the calculation of the volatility in jlOj . lacking as yet a method to deal with all 
the complicated terms generated by the factor proportional to the learning rate fj, we 
have to restrict ourselves in practice to approximations. As in jTU] we first remove the 
most tricky terms by putting fj — > 0. This gives 

^A = E (-^A) r+r ' E G(s ) . • • £ G(4) . . . G«'-i) 



r,r'>0 



S ...S r -l 



x E<Vr / 11111 ttE 5 

k \ s=U 



A,A(s,A,z) (3 A,A(s+fc,A,z)^ s ^ s + fc 



(129) 
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We then assume that the limit L — > oo in the last line converts the associated sample 
average into a full average over the statistics of the Gaussian fields <f>e given by ()62|). i.e. 

1 L 



lim -g-jr 



E 5 . 



XMsA^XMs+w^^+k)^ = - [1 + 



Separating the correlation function into a persistent and a non-persistent term, C(fc) = 
c + C(k), and returning to the earlier notation with time differences inside the kernels 
G, results in the history-conditioned equivalent of the volatility approximation in jlOj : 

^A = E(-<W^A) r+r ' E G(s )...G(s r ^) £ . . . C(4,_ x ) 



r,r'>0 
1 



s o-V-i 



X 



i+ c +c(5>i-£*;-: 



l + c 



1 



2(] . xl>Kx) 2 + 2j dsds ' (i+p*x G r c ( s +^A Gr )~ 1 ( 13 °) 

where y) = <5(x — y). In order to get to the present stage we have averaged the in- 
dependent terms inside A 2 A over the Gaussian measure (. . -)^\a,z- Consistency demands 
that in working out a\ = A 2 ^ — A\ we do the same with the term A\, where is 
given by ()124|) . so our approximation for the history-conditioned volatility becomes 



A 2 - 



(fx) 



1 



-V 1 J V x x l + C(£- £') 

X (Lvr A ) 2 £± x °XMW)°X,X(1',AZ) 2(1 + xp7lx )2 



dsds' (l + pvr A G)- 1 (s)C(s -s')(l + P 7T X G)- l (s" 



(131) 



Our final step again follows [TO]. We assume that the non-persistent correlations C(t) 
decay vary fast, away from the value C(0) — 1 — c, so that in the expansion of ()131|) in 
powers of G we retain only the zero-th term: 



5 (l-c) 



(132) 



We may now return to expression 



£1 = lim Y irx Erf 2 



and insert our approximations 
(1 " C)0A 



and 



with 



« f?(/,0)/Erf 



v^a + xp*\) \je* 2 + |(i - o 2 (i - c) 

(1-00 



v^(i + x/)VC 2 « 2 + |(i-C) 2 (i-c) 



(133) 



(134) 
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We know the 0^ to be Gaussian variables, with (4>\) = and (4>\) = |(1 + c) (see the 
above derivation of a 2 ^ where this was shown and used). Hence, upon making our final 
simplifying assumption that in the relevant orders of our calculation the correlations 
between the history frequencies ir^ and the Gaussian fields 0^ are irrelevant, we obtain 

-0»/(l+c) 

g(f,4>) = g(f) , , ■ (135) 



7T 1 + C 



and hence ()133|1 simplifies to 



n 



df Q(f) f DxEri 



2(i + x/)VC 2 « 2 + K 1 -0 2 (i-c) 



(136) 



Using the integral JDx Erf (Ax) = - arctan [yl + A A 2 ] — 1, in combination with the 
identity fdf g(f)f 

df o(f)f arctan 



n 



4 

7T JO 



1, our approximate expression for the parameter Q thus becomes 

(l-C) 2 (l + c) 



1 + 



- 1 



(137) 



:i+x/) 2 [c 2 ^ 2 + |(i-o 2 (i-c) 

In the limit of strictly fake history we recover from ()137|) the value lim^i Q = 
(4/7r) arctan [1] — 1 = 0, as it should. For MGs with strictly true market history, on 
the other hand, expression (|137|) simplifies to 



4 roo 

limf2 = — df g(f)f arctan 

7T JO 



2(1 + c) 



(1 + X /)2(1-C) 



- 1 



(138) 



In accordance with earlier observations in simulations ^2] we also see that, as the system 
approaches the phase transition when a is lowered from within the ergodic regime, the 
increase of the susceptibility x automatically reduces the width parameter Q, until it 
vanishes completely at the critical point. 



7. Closed theory for persistent observables in the ergodic regime 

We have now obtained a closed theory for the time-translation invariant states of our 
MG, albeit in approximation. It consists of the equations (|84|85|8fi|87j) for the persistent 
order parameters, combined with expressions ()116I118|) for the shape and (|114I137|) 
for the width of the relative history frequency distribution g(f). This theory predicts 
correctly (i) that the phase transition point a c (T) of the MG with history is identical to 
that of the model with fake memory, (ii) that at the transition point the relative history 
frequency distribution reduces to g(f) = 8[f — 1] (with at that point also the order 
parameters all becoming independent of whether we have true or fake history), and (iii) 
the shape of the relative history frequency distribution. In the limit a — * oo the theory 
also reproduces the correct order parameter values x = <P = c = 0> f° r an y value of £■ 
For C = (strictly true memory) it predicts lim^oo ft = | and hence lim^oo fi 2 ~ 1.37. 
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Figure 3. Left: the predicted persistent correlations c together with simulation data 
in the non-ergodic regime, for the on-line MG with strictly true history (i.e. £ = 0; 
the solid line gives the theoretical prediction, full circles the experimental data) and 
for the on-line MG with strictly fake memory (i.e. £ = 1; the dashed line gives the 
theoretical prediction, open circles the experimental data) . In both cases decision noise 
was absent. Right: the corresponding predicted fraction <f> of frozen agents, under the 
same experimental conditions and with the same meaning of lines and markers. 
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Figure 4. The moments fi2 — fdf g(f).f 2 and /13 = fdf g(f)f 3 of the distribution of 
relative history frequencies for the MG with strictly true history and absent decision 
noise (i.e. ( = T = 0), as predicted by the theory (solid and dashed lines), compared 
to the moments as measured in numerical simulations (markers, with circles indicating 
fi2 and squares indicating fi 3 ). Note that ^0 = A*i = 1 (by definition). 



Let us finally reduce our closed equations to a more compact form, for the simplest 
nontrivial case of the MG with strictly true market history (i.e. ( = 0) and without 
decision noise (i.e. a[oo] = 1). Here we have 

« X= 1 —^ 0=l-ErfM (139) 

o V^ a X R 
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1 - Erf M + — Erf [u] 



Xr 



r.2 
°0 



n 



Dz 



6 



Vtt(3z - z 3 ) 



U\/TT 

z^/n+^v+^n+^n 2 ) 



+ X 



1 + c) I Dz 
1 



1 + -Vtt(3z - z 2 
6 



X 



(140) 

(141) 
(142) 



Dz 



i + -Vn(3z - z 3 ) 



6 



arctan 



7T 



1 + 



2(1 + c) 



:i-ci 



1 + xe 



(n+in2) 



1 (143) 



Upon using ()14()j) to write c as a function of u, i.e. c = c(w) with c{u) denoting the 
right-hand side of ()14(j|) . and upon eliminating the quantities <f> and So, we find ourselves 
with a closed set of equations for the trio {u, x, Q}'- 



Erf[d 



u 



X 



n 



W2a(l + c 




i + ±Vn(3z - z 3 ) 



^ 2 + i(0+iQ 2 ) 




1 + ±Vtt(3z 



,^+1^+1(0+1^) + 



1 + -^(3^ - z 3 ) 
o 



arctan 



7T 



1 



e 2 v /f1+|Q2_l (n+ l n 2 ) 

2[l + c(t*)] 



(144) 



(145) 



(146) 



1 



[1 _ c ( w )] [l + xe z^W-W+W) 

Solving these three coupled equations numerically, followed by comparison with 
simulation data, shows a surprising level of agreement, in spite of the expansions 
and assumptions which have been used to derive ()144I145I146|) . Figure El shows the 
performance of the theory in describing the on-line MG with strictly true market history 
(i.e. ( = 0), together with similar data for the on-line fake history MG (i.e. £ = 1), for 
comparisonff. In all these simulations iV = 8193. Calculation of the first two non-trivial 
moments of the distribution of relative history frequencies, see e.g. figure 0] (where 
in the simulations a>N 2 = 2 28 ), shows that for small values of the width of g(f) (i.e. /i 2 
close to one, which is true close to and below the critical point) the predictions of the 
theory are excellent, but that the performance of equations ()144I145I146|) deteriorates 
for larger values of \xi- This is obvious, since these equations result effectively from an 
expansion for small values of — 1 . Taking this expansion to higher orders should lead 
to systematic improvement, but will be non-trivial. 

ff Below the critical point, where \ — 00 throughout, equation (|137f) predicts that O = 0. This implies 
that g(f) = 6[f — 1] for a < a c (T), and that below the critical point the differences between true 
and fake history (if any) are confined to dynamical phenomena or to states without time-translation 
invariance. This confirms earlier observations in numerical simulations |15|. where it was found that 
the persistent order parameters in MGs with and without history were identical in the low a regime. 
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8. Discussion 

We have developed a mathematical procedure for the derivation of exact dynamical 
solutions for Minority Games with real market histories, using the generating functional 
analysis techniques of [I7| . So far these techniques had only been developed for (and 
applied successfully to) the less realistic but mathematically simpler MG versions with 
fake market histories, restricting theoretical progress to those particular game versions 
only. We have shown how the technical difficulties associated with the non-Markovian 
character of the microscopic laws induced by having real histories can be dealt with, 
and found (in the infinite system size limit) exact and closed macroscopic laws from 
which to solve the canonical dynamic order parameters for the standard (on-line) MG 
with true market history Here these laws turn out to be formulated in terms of two 
effective equations (rather than a single equation, as for models with fake histories): 
one for an effective agent, and one for an effective overall market bid. In the second 
part of this paper we have constructed solutions for these effective equations, focusing 
mostly on the usual persistent observables of the MG in time-translation invariant states 
(persistent correlations and the fraction of frozen agents) and on the calculation from 
first principles of the distribution of history frequencies. These objects are calculated in 
the form of an expansion in powers of the width of the history frequency distribution, 
of which the first few terms are derived in explicit form. The final theory was shown to 
give accurate predictions for the persistent observables and for the shape of the history 
frequency distribution. It gives precise predictions for the width in the region where 
this width remains relatively small (which is inevitable in view of the expansion used). 
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Appendix A. Recovering the fake history limit 



It helpful for our understanding of the N — > oo limit in (|59|fi()|) to first return to the 
simplest case where we know what the outcome should be, being £ = 1, i.e. fake history 
strings of the inconsistent type (jlj). This is the model which was solved in ^Qj- In doing 
so we en passant re-confirm the correctness of the assumed scaling 5n = fj/2p. 

For ( = 1 we see in (JT5|l and (fT7jl that both A(. . .) and W[. . .] lose their dependence 
on the path {A}, and reduce to 

/ sgn[Z(£,l)] \ _ 
V sgn[Z(£,M)] J 

The role of the Gaussian variables {Z} has thereby been reduced to determining the 
statistics of the symmetric random matrix B with entries Bw = W[£ } £'; Z\. 

iog 2 (p) 



V[B\ = (T[6[Bu> - LI e[Z(£,X)Z(£',X)} 



(A.2) 



A=l 



{^} 



with p = 2 M = aN. The two relevant properties of these matrices are relatively easily 
derived, and are found to be the following. For any cyclic combination of r-th moments 
(with r > 0), where Si > S2 > ■ ■ ■ > s r and r > 1 (no summations) one has 

(B SlS2 B S2iS3 ...B SriSl ) B = p l - r (A.3) 

The second type of average one needs involves two time-ordered strings of matrix 
elements (of lengths r and r', respectively) connected by two further matrix elements, 
where so > si > . . . > s r and s' > s[> . . . > s' r ,\ 



{[B S0Sl B SlS2 . . . B Sr _ lSr ]B SrS ' r \B s ' oS iJ3 s ' iS ' 2 . . . B s > r/ iS >JB SoS > o ) 

Er = oE,=o'W.- r - r '- 1 



p 



(A.4) 



The partial decoupling of the paths {^4} and {Z} implies that our expressions for the 
kernels R and E simplify to 



£=t/8 N ,e'=t'/5 N 

e=t/s N ,e'=t'/s N 



(A.5) 
(A.6) 



E(f,f) =fj hm - {Bur (A(£)A(£')) {Am/ 

Since the bid evolution process is now linear in {A}, and involves only {A}- 
independent zero-average Gaussian fields (pi, where ((pe4>i') {^b} = \Bw\V + C (£,£')], 
it is easily solved for any given realization of the random matrix B: 



A{i) = A e {£) + ^ + E(-?) r E[( GB ) r ]« IMk) + <f>k) 



(A.7) 



in which GB denotes the matrix with entries [GB)i£ = G(£,£')Bu' (i.e. involving 
component multiplication rather than matrix multiplication). To make a comparison 
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with the results of [10J we must remove the external bid perturbations A e (£) after they 
have served to generate the response function R. 

We can evaluate (|A.5|) using only expression (|A.7|) . the causality of the response 
function, and formula (|A.3|) . These give, with 8/8A e (£) = 8^~ x d / f dA(£): 

= ^%a£w)-tf dB m B "' {mhm 

= " m „f E("?r fdB V[B] B«-[(GB)'] a , 
o d N r > 2 J 

= lim ^-\dw-8 N G(£,£') 

+ E(-<V) r E G(£,s 2 )G(s 2 ,s 3 )...G(s r ,£')\ (A.8) 

r>l s 2 >s 3 >...>s r ) 

We observe in (jA.8|) . in view of Su' 6jfS(t — f) in the limit N — > oo, that the canonical 
scaling of time (modulo 0(1) factors) is indeed 5n = fj/2p. We then find exactly the 
expression in JU] for the on-line 'fake history' MG: 

R(t, t') = 5(t - t') + ]T(-l) r G r (t, t') = [1 + G]-\t, t') (A.9) 

Had we chosen an alternative scaling with N of 5n, we would have found either the 
trivial result R = 0, or an ill-defined expression. 

Next we turn to expression ()60|1 for the effective agent's noise covariances, with 
A e = 0. The equivalence of the present expression and that in will be more 
transparent upon renaming (£, £') — > (s , s' ) and D(k, k') — 1 + C(/c, fc'): 

E(5 ,4) = lim n f /dBP[B] B sos , (A(s )A(s' )) {m 

= lim 77j- E (-?) r+r ' E E ^,V) 

<W-*o 2d N r r ,> Q 2 s1 ___ St ^ ^ 

x G(s , s x ) ... G(s r -i, s r ) G(s' , s[) . . . G{s' r ,_ 1 , s' r ,) 

x ((B S0S1 . . . B Sr _ lSr )B s<i (B s[A . . . B s , r _ iS ,)B S0S ,) B (A.10) 

with the proviso that when r = we must interpret the sums as Y^ Sl ...s 2 ~^ 1> 
G(sq, Si) . . . G(s r -x, s r ) — > 1 and B SQSl . . . B Sr _ lSr — > 1 (and similarly when r' = 0). 
Since the kernel S(so, s' ) is symmetric, we may without loss of generality choose s' > so. 
Dependent on the whether any or both of the indices (r, r') are zero, we have to evaluate 
the following averages (with the short-hand Sij — 1 — <Jy): 

• r = r' = 0: here the average of the last line in (jA.lOjl reduces to 

(• • )b = (B 2 S0S ,) = (B S0S ,) = 5 S0S , + h SoS , o (A.11) 

• r' — 0, r > 0: here the average in (jA.lO)) reduces to two terms (representing the 
cases s = s' Q versus s < s' Q ), which are both of the form (jA.3|) . 

(...) B =((B SoS1 ...B Sr _ 1Sr )B SrS ,B sos , o ) =p- r 6 S0S ,+p- r -% 0S , (A.12) 

where we used B^k = 1, for any k. The case r = 0, r' > is clearly equivalent. 
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• r, r' > 0: now the relevant average reduces to that of (jA.4|) . 

(• • )b = {[Bbosi ■ ■ ■ Bs r - lSr }Bs r s> r , [^s' s' x ■ ■ ■ ^s' rl _ lS ' rl ]B SoS ' o ) 
T r T r ' S ,- r -r'-l 

= P LmLf0 ^ (A. 13) 

Expression (|A.13J) reduces to those derived for the cases where r or r' is zero (or both), 
so it is true for any (r, r'). We may thus insert (jA.13|) into (jA.lOjl . and obtain: 

E( So , s ' ) = hm yi (-<wr +r ' e e D ^ s' rl ) n fi [i + (p - 1)<M 



r,r'>0 si-Sr s ;..i, i=0j=0 

x G(s , si) . . . G(a r _i, s r ) G(s' , s[) . . . G(s' r ,_ 1 , s' r ,) 
lira £ (-^Y +r ' E E ^,40 



N—>oo 



SI— Sr s' v ..s' , 



r,r'>0 

i= 0j =0 1 I6 N 



X G{Sq, Si) . . . Cr(s r _i, s r ) G(s , S-l ) ■■■G{s' r ,_ 1 ,s' rl ) 

= Yi (- i ) r+r / ds i ■ ■ ■ ds rds[ . . . ds' r , n n I 1 + ?v8[si - 4 

r,r>>0 i=0j=0 ^ 

x G(s , si) . . . G(s r _i, s r ) G(sq, si) . . . G{s' r ,_ 1 , s' r ,) 

x [1+C(s r ,s' r ,)) (A.14) 

Also ()A.14|) is identical to the corresponding expression in JTU], as it should. 



Appendix B. Expansion of bid sign recurrence probabilities 

Here we derive the expansion f)106j) of the function $(<7i, #2, • • •) as defined in (jUUj) . We 
abbreviate 

t 1 - O^A 



£ A = Erf 



-V2^ 2 + (l-C) 2 4- 
with Sa^A^A = (-^ r )- These short-hands allow us to compactify (PUJ) to 



(B.l) 



^n[E-A(i+^' 

i>i A 

1 . 3j 



+ ^n[E-Aa-^: 



<ij 



9j 



i>! A 



<^>Hn[E 

j>l n=0 
1 

2 



9j 



l) n (E n ) 



n[E 

j>l n =0 
9i 92 

E E • • • ( I 1 I ( ! 2 I • • • 77 t 1 + (-l) ni+n2+ -](E ni )(E n2 ) . . 

ni=0 ri2=0 

(B.2) 



Since the overall average bid in the MG is equally likely to be positive than negative, 
and since ()B.1|) tells us that sgn[S^] = sgn[A^], the moments (E r ) for even values of r 
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will have to be zero. From this it follows that 
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E E ••• 



9i 



92 



0<ni<ic/i 0<n 2 <ig2 



2ni J \ 2n 2 



^ ...(E 2ni )(E 2n2 ) ... 



n 

i>i 



E 



9j 
2n 



(E 2n ) 



so 



log $(01,02, 



Ei°g[i+ E 



l<n<g 3 /2 



9j \ , E 2n, 

2n r 



Equation ()B.3|) tells us, firstly, that 



$(1,1,1,...) = ! 



(B.3) 



(B.4) 



For arbitrary history coincidence numbers (gi,g 2 , . . .), not necessarily all equal to one, 
we may expand (|B.3)1 in the moments (E r ): 



log$( 9l ,g 2 , . . .) = J>g [l + l 9j fa -1)(E 2 ) + Lg.fa -l)( gj - 2 )( 9j -3)(E 4 ) 

+ 0((£ 6 » 

= \ E9j(9j "I) {(E 2 ) + ^[( 9j -2)(gj -3)(£ 4 > - 3 gj ( gj -l)(E 2 } 2 } } + 0((£ 6 » 

Finally, in leading order in E we may regard the variables E^ as proportional to A^, 
and therefore as distributed in a Gaussian manner. This implies (since (E) = 0) that 
in leading order we have (E 4 ) = 3(E 2 ). Hence 

log <&(<?!, <?2,...) = ~<£ 2 >E&(& - 1) - TWEft-Cft - W 2 *- - 3 ) 



+ 0((E 6 )) 



(B.5) 



Appendix C. Combinatorics in history frequency moments 

In this appendix we calculate the combinatorial factors G k, f as defined in (jlllj) . They 
can be obtained by differentiation of a simple generating function: 

G k a f = R- k t k f( k )( k ~ gi )(R- ir 9l - 92 9l9 b 2 

» 1= qn=0 \9i J \ 92 ) 



R~ k lim ( x ^-) a (y^-) b (R - 2 + x + y) k 
x,y-*i dx ay 



)-2 



In particular: 

G k f = kR' 1 + k{k - l)R- 
G k f = kR- 1 + 3k{k - 1)R- 2 + kik - \)(k - 2)iT 3 



(C.l) 

(C.2) 
(C.3) 



G k 4 f = kR- 1 + 7k{k - l)R~ 2 + 6k{k - l)(jfe - 2)R- 3 + kik - l)(k - 2){k - 3)iT 4 (C.4) 
G k f = k{k - l)R- 2 + 2kik - l){k - 2)R- 3 + kik - l)(k - 2){k - 3)iT 4 (C.5) 



